False Discovery Rate Control via Data Splitting
نویسندگان
چکیده
Selecting relevant features associated with a given response variable is an important problem in many scientific fields. Quantifying quality and uncertainty of selection result via false discovery rate (FDR) control has been recent interest. This article introduces data-splitting method (referred to as “DS”) asymptotically the FDR while maintaining high power. For each feature, DS constructs test statistic by estimating two independent regression coefficients data splitting. achieved taking advantage statistic’s property that, for any null its sampling distribution symmetric about zero; whereas positive mean. Furthermore, Multiple Data Splitting (MDS) proposed stabilize boost Surprisingly, under control, MDS not only helps overcome power loss caused splitting, but also results lower variance proportion (FDP) compared all other methods consideration. Extensive simulation studies real-data application show that are robust unknown features, easy implement computationally efficient, often most powerful ones among competitors especially when signals weak correlations or partial high. Supplementary materials this available online.
منابع مشابه
False Discovery Rate Control via Debiased Lasso
We consider the problem of variable selection in high-dimensional statistical models where the goal is to report a set of variables, out of many predictors X1, . . . , Xp, that are relevant to a response of interest. For linear high-dimensional model, where the number of parameters exceeds the number of samples (p > n), we propose a procedure for variables selection and prove that it controls t...
متن کاملTesting Jumps via False Discovery Rate Control
Many recently developed nonparametric jump tests can be viewed as multiple hypothesis testing problems. For such multiple hypothesis tests, it is well known that controlling type I error often makes a large proportion of erroneous rejections, and such situation becomes even worse when the jump occurrence is a rare event. To obtain more reliable results, we aim to control the false discovery rat...
متن کاملPrivate False Discovery Rate Control
We provide the first differentially private algorithms for controlling the false discovery rate (FDR) in multiple hypothesis testing, with essentially no loss in power under certain conditions. Our general approach is to adapt a well-known variant of the Benjamini-Hochberg procedure (BHq), making each step differentially private. This destroys the classical proof of FDR control. To prove FDR co...
متن کاملCommunication-Efficient False Discovery Rate Control via Knockoff Aggregation
The false discovery rate (FDR)—the expected fraction of spurious discoveries among all the discoveries—provides a popular statistical assessment of the reproducibility of scientific studies in various disciplines. In this work, we introduce a new method for controlling the FDR in meta-analysis of many decentralized linear models. Our method targets the scenario where many research groups—possib...
متن کاملAdaptive False Discovery Rate Control for Heterogeneous Data
Efforts to develop more efficient multiple hypothesis testing procedures for false discovery rate (FDR) control have focused on incorporating an estimate of the proportion of true null hypotheses (such procedures are called adaptive) or exploiting heterogeneity across tests via some optimal weighting scheme. This paper combines these approaches using a weighted adaptive multiple decision functi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the American Statistical Association
سال: 2022
ISSN: ['0162-1459', '1537-274X', '2326-6228', '1522-5445']
DOI: https://doi.org/10.1080/01621459.2022.2060113